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1.  INTRODUCTION 

If  the  wind  components  at  one  pressure  level  of  the  atmosphere  are  given 
and  the  vertical  wind  shear  is  known  between  the  given  level  and  another 
level,  the  winds  at  the  second  level  can  be  inferred.   Over  data-sparse 
regions  of  the  earth,  such  as  the  oceans,  winds  usually  are  known  at  only  a 
few  levels  because  the  data  are  limited  to  aircraft  reports,  cloud  motion 
vectors  derived  from  time-lapse  sequences  of  satellite  cloud  pictures,  or 
constant  pressure-level  balloons.   However,  if  in  addition  the  vertical  wind 
shear  were  known,  then  the  winds  could  be  determined  at  intermediate  levels 
as  well.   It  is  the  purpose  of  this  report  to  examine  from  a  theoretical 
point  of  view  the  possibility  of  deriving  the  vertical  shear  of  the  geostrophic 
wind  from  horizontal  gradients  of  satellite-measured  radiances. 

Work  of  this  kind  has  been  done  by  Zak  and  Panofsky  (1968)  and  by  Grody, 
et  al.  (1979),  but  in  both  papers  the  authors  used  radiances  from  only  one 
spectral  interval  (channel).   It  is  shown  in  this  report  that  the  use  of 
linear  combinations  of  radiances  from  many  channels  yields  greater  accuracy. 
Furthermore,  an  extensive  study  is  made  to  determine  the  best  pressure  inter- 
vals to  use  for  the  Special  Sensor-H  (SSH)  sounder  of  the  Defense  Meteorologi- 
cal Satellite  Program  (DMSP)  and  for  the  Tiros-N  Operational  Vertical  Sounder 
(TOVS)  instrument  package  on  the  TIROS-N/NOAA  series  of  operational  satellites. 
Once  the  optimum  pressure  interval  is  determined,  one  of  the  boundaries  of 
the  interval  becomes  the  pressure  level  at  which  the  winds  must  be  known  and 
the  other  boundary  becomes  the  pressure  level  at  which  the  winds  can  be 
inferred. 

2 .  THEORY 

The  relationship  between  the  vertical  atmospheric  temperature  profile  as 
a  function  of  pressure  T(p)  and  the  corresponding  radiance  R  emerging  from 


the  top  of  the  atmosphere  and  measured  by  satellite  over  a  wavenumber  inter- 
val (channel)  centered  at  v  is  given  by  the  radiative  transfer  equation 


Ps 

dT  (p) 

B^  [T(p)J  [-  -^ ]  dp.  (1) 


In  Eq.  (1)  B  (T)  is  the  Planck  function  at  wavenumber  V  and  temperature  T, 
T  (p)  is  the  fractional  atmospheric  transmittance  between  the  pressure  level 
p  and  the  effective  top  of  the  atmosphere  p  ,  and  p   is  the  surface  pressure. 
Note  that  the  usual  boundary  term  in  Eq.  (1)  is  missing  because  only  those 
channels  that  do  not  sense  the  ground  radiation  are  used.   The  reason  for 
this  is  that  it  would  be  difficult  to  interpret  the  gradient  of  such 
radiances.   Furthermore,  the  geostrophic  wind  equation  is  not  valid  in  the 
boundary  layer  because  of  surface  frictional  forces. 

Another  relationship  that  will  be  needed  is  the  one  between  the  geo- 
strophic wind  shear  and  the  horizontal  gradient  of  temperature  which  is 
given  by  the  thermal-wind  equation 


"^  '^  IW '  'p'-  <'' 


where  V  is  the  geostrophic  wind  vector,  k  is  the  vertical  unit  vector,  V 

is  the  gradient  operator  on  the  surface  of  constant  pressure,  and  the  scalar 

a  =  f/R 

is  the  ratio  between  the  coriolis  parameter  f  and  the  gas  constant  for  air 
R. 

To  make  Eqs.  (1)  and  (2)  compatible,  one  can  choose  a  fixed  vertical 
temperature  profile  T   that  is  sufficiently  representative  of  the  area  to 
which  the  gradient  operator  is  being  applied  that  the  following  Taylor 


approximation  is  valid: 


dB  (T  ) 


Therefore,  if  Eq.  (4)  is  applied  to  Eq.  (1)  and  the  isobaric  gradient 

operator  V   is  applied  to  the  resulting  equation,  one  obtains 
P 


t 


^T 


where  the  kernel  function 


dB  [T  (t)3   dT  (t) 
v^^^    ~  dT(t)      dt   d(£np)'  ^^^ 


and  the  variable  of  integration  p  has  been  replaced  by  the  transformation 

t  =  a  p^/^  +  b  (7) 

in  which  a  and  b  are  appropriate  constants.   (The  purpose  in  making  the 

transformation  (7)  is  to  distribute  the  kernel  functions  more  uniformly 

throughout  the  vertical  extent  of  the  atmosphere,  since  they  determine  the 

levels  of  the  atmosphere  from  which  the  radiation  is  emitted.)   The  limits 

of  integration  t   and  t   in  (5)  are  the  transformed  surface  pressure  and 

effective  top  of  the  atmosphere,  respectively.   Note  that  terms  in  (4) 

involving  T  vanish  when  the  gradient  operator  is  applied  to  them  because 

they  are  constant  over  the  horizontal  space  coordinates. 

Eqs.  (2)  and  (5)  are  now  related  through  V  T,  that  is, 

P 

t 
s 


J  '^      ^ 


^T 


which  is  one  of  the  fundamental  equations  we  will  need.   A  second  basic 
result  is  obtained  by  integrating  (8)  by  parts  to  obtain 


4: 
^  -.          dK  (t) 
VpR^  =  -ak  X  I    V(t)  -^^ dt,  ■:   (9) 


a; 

/ 


^T 


where  the  boundary  term  vanishes  because  K  (t)  vanishes  at  t   and  at  t^. 

■'  V  s         T 

One  now  can  work  with  either  Eq .  (8)  or  (9). 

The  approach  of  Zak  and  Panofsky  (1968)  will  be  followed  first  by  con- 
sidering Eq.  (9).   Since  the  kernel  function  K  (t)  somewhat  resembles  a 
normal  density  probability  function  in  that  it  has  a  single  peak  value  and 
falls  off  to  zero  on  each  side  of  its  peak  value,  its  derivative  has  a 
bimodal  shape.   An  example  of  such  a  curve  is  given  in  Fig.  1  by  the  solid 
curve  labeled  D- .   The  shape  of  curve  D^  is  typical  of  the  function  dK  (t)/dt  in 
that  it  has  well  defined  positive  and  negative  modes  which  permit  Eq .  (9) 
to  be  written  in  the  form 


t  t 

.c       ,„  ,..       .s 


V  R  =  -ak  X  {   I    V(t)  — ~ —  dt  -  /    V(t) 


V 


dt 


dt   ,  (10) 


where  t   is  the  zero-crossing  level.  Since  the  differentiated  kernels  in  (10) 
are  both  positive,  the  second  mean-value  theorem  for  integrals  applies,  and 
one  can  write  Eq .  (10)  in  the  form 

ApR^  =  6k  X  [V(t^)  -V(t2)]  =  6k  X  6V,  (11) 


where 


3  =  [-aK^(t^)]"^  ,  (12) 


since  K  (t  )  =  K  (t  )  =  0.   All  that  is  known  about  the  arguments  t   and  ty 
is  that  they  are  in  the  intervals  [t  ,  t  ]  and  [t  ,  t  ],  respectively. 
However,  since  the  differentiated  kernel  functions  have  well-defined  peaks 
in  each  of  these  intervals  (see  curve  D^  in  Fig.  1),  reasonable  approxima- 
tions for  t^  and  t„  are  the  pressure  levels  of  these  peaks. 

Since  6V  can  be  determined  from  Eq.  (11),  one  can  write  the  required 
geostrophic  wind  components  as  follows: 

9R 
u(t^)  =  u(t2)  +  6u  =  u(t2)  +  3(-g^)p  (13) 

8R 
v(t^)  =  v(t2)  +  6v  =  v(t2)  -  3(9^)p.  (14) 


where  the  wind  components  at  level  t^  are  inferred  from  those  given  at 
level  t   and  from  the  components  of  the  radiance  gradient.   Eqs.  (13)  and 
(14)  are  the  results  obtained  by  Zak  and  Panofsky  (1963) ;  however  something 
more  can  be  done  to  improve  them. 

The  two  modes  of  curve  D^  in  Fig.  1  are  so  broad  that  there  is  a  high 
probability  that  the  components  of  the  radiance  gradient  do  not  represent 
the  geostrophic  wind  component  differences  at  levels  t,  and  t„  as  determined 
by  the  peak  values.   Clearly,  what  is  required  is  that  the  two  modes  of  the 
bimodal  kernel  function  derivative  be  made  to  approach  Dirac  delta  functions 
(distributions)  as  clearly  as  possible.   A  method  for  achieving  this  is 
provided  by  a  variant  of  the  Backus-Gilbert  method  (Backus  and  Gilbert,  1970) 
(Also  see  Conrath,  1972.) 


10 


3.   MODIFIED  BACKUS-GILBERT  METHOD  "■ 

Since  radiances  are  measured  in  a  number  of  channels,  it  is  reasonable 
to  use  all  of  those  radiances  that  do  not  have  a  component  of  ground  radia- 
tion in  a  linear  combination  to  accomplish  the  objectives  described  in  the 
preceding  section.   Begin  by  replacing  the  continuous  variable  v  in  Eq .  (9) 
by  the  index  i  to  signify  the  ith  channel,  that  is,  write  (9)  in  the  form 


.    I  '   -    dK.(t) 

r.  =  V  R.  =  -ak  X  /     V(t)  —^ dt,     i=l,...,n.         (15) 

1    p  1         /  dt      '        '    ' 


/ 


If  one  now  multiplies  each  of  these  n  equations  by  a  yet-unspecified  coeffi- 
cient c.  and  then  sums  the  resulting  n  equations,  the  following  linear  com- 
bination is  obtained: 


dK.(t) 


/     '^'^^      ^i^ 
J  i=l 


c^r^  =  -ak  X   /    V(t)[>  .  c,  —^f ]  dt.  (16) 


T 


The  quantity  in  square  brackets  is  called  the  averaging  kernel  and  will  be 
denoted  by  D,  that  is, 

A      dK  (t) 
D(c,t)  =2]   c.^^— .  (17) 

i=l 

One  is  free  to  choose  the  coefficients  c.  of  (16)  in  any  reasonable 
manner;  therefore,  they  should  be  chosen  so  as  to  give  the  averaging  kernel 
(17)  the  following  properties: 
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1.  It  should  be  bimodal  with  a  positive  and  a  negative  lobe,  each  of 
which  approximates  a  Dirac  delta  function  as  closely  as  possible  at 
predesignated  pressure  levels  t^  and  t„. 

2.  The  two  lobes  should  each  have  unit  area,  but  the  areas  should  be 

of  opposite  sign,  and  the  crossover  point  should  be  at  t  =  (t,+t^)/2, 
The  first  property  is  realized  by  minimizing  the  quadratic  form 


t 

Q(c)  =   /    (t-t^)^t-t2)^[D(c,t)-6(t-tp+6(t-t2)]^dt 
t„ 


/ 


^T 


t 
s 


(t-t^)^t-t2)^  D^c,t)  dt  (18) 


-^  2  2 

with  respect  to  the  vector  c.   The  factors  (t-t,)   and  (t-t  )   in  (18)  are 

introduced  to  play  the  role  of  penalty  functions.   They  are  penalty  functions 

because  they  cause  Q  to  grow  rapidly  in  magnitude  if  the  squared  quantity  in 

brackets  deviates  substantially  from  zero  away  from  the  points  t^  and  t„. 

Thus,  Q  is  penalized  least  if  D  is  a  maximum  in  the  vicinity  of  t   and  t„ 

and  a  minimum  elsewhere.   The  second  property  listed  above  translates  into 

the  two  constraints 


/    D(c,t)  dt  =  -  /    D 

T  t 

c 


(c,t)  dt  =  1  (19) 


and 


D(c,t^)  =  0,  (20) 
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with  t  =  (tT+t„)/2. 

->• 
We  now  wish  to  determine  the  vector  c  by  minimizing  the  quadratic  form 

(18),  subject  to  the  constraints  (19)  and  (20).   This  is  a  classical  prob- 
lem that  is  readily  solved  by  introducing  a  pair  of  Lagrangian  multipliers 
A  and  y  and  combining  Eqs.  (18),  (19)  and  (20)  into  a  single  expression, 
defined  by  Q,  that  must  be  minimized,  where 

Q(c)  =  c^Sc  +  A(c'^a-l)  +  y(c'^b).  (21) 

The  elements  of  the  matrix  S  and  the  vectors  a  and  b  are,  by  (17),  (18), 
(19)  and  (20),  respectively, 


t 

/^  dK  (t)  dK.(t) 

(^-^l)  (^-4)  -±—^^^^^  (22) 

4 


'ij    J     ^^  ^1^  ^^  ^2'        dt     dt 


t  t 

■^   dK.(t)         /   dK.(t) 


/dK  U)          r        dK.  (t) 
-^ dt  =  -  /    —7^^ dt  =  K.(t  ) 
dt           I     dt  1^  c' 


(23) 


to,  t' 

T  c 


and 


dK.(t  ) 
\--k^  (2A) 


for  i,j=l,...,n.   The  coefficient  vector  c  which  minimizes  expression  (21) 
satisfies  the  normal  equation  '  '  • 


-^  =  2Sc  +  Xa  +  yb  =  0, 
do 


or 


c  =  -  I  S  ^(Aa  +  Mb).  (25) 
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Multiplying  c  by  a  and  b,  one  satisfies  constraints  (19)  and  (20),  that 
is, 

c  a  =  -  ^(Aa  +  yb  )  S  a  =  1  (26) 

and 

t^t   -  -  i(X:^  +  Mb^)  S-^b  =  0.  (27) 


If  one  sets 

a  -  4  th-^t,   B  =  -  i  th-H   =  -  i  b^S-lJ,  Y  .  -  i  S^S-^E,     (28) 


then  (26)  and  (27)  reduce  to  the  pair  of  simultaneous  equations 

aX  -  3y  =  1 
gA  +  yy  =  0, 

/ 
whose  solutions  are 

A  =  -^ and       y  =  -y-^ ,  (29) 

3  -  ay  3  -  ay 

2 
provided  3  -  ay  ?^  0.   Substitution  of  (28)  into  (29),  and  in  turn  (29)  into 

->- 
(25),  yields  the  required  coefficient  vector  c. 

We  will  now  have  to  digress  somewhat  because  when  the  calculations  of 

Eqs.  (22)  through  (29)  were  performed  with  real  data  disaster  struck  quickly, 

The  difficulty  was  not  with  the  theory,  but  with  an  unexpected  practical 

matter,  namely,  the  system  of  equations  (26)  and  (27)  was  singular.   In 

2 
other  words,  3  -  ay  =  0  in  solution  (29),  or  nearly  so.   The  reason  this 

happened  was  that  for  most  of  the  cases  tried, the  zero  crossover  naturally 

fell  at  the  point  (t^+t„)/2,  without  having  to  stipulate  it  as  such  through 


14 


Eq.  (20).   Hence,  the  constraint  (20)  was  redundant  with  what  was  already 
an  accomplished  fact  and  the  system  became  singular.   To  rectify  the  situa- 
tion it  is  necessary  to  delete  constraint  (20) ,  which  resulted  in  the 
following  changes.   (The  cases  in  which  the  system  was  not  singular  will 
be  discussed  later.) 

First  expression  (21)  takes  the  form 

^  -)-    -vT  -l->  ^T-^ 

Q(c)  =  c  S   c  +  A(c  a-1),  (30) 

which  when  minimized  has  the  solution 

-»■     A  -1^ 

c  =  -  Y  S   a  (31) 

[see  (25)],  so  that  '  .- 

ca  =  -  —  aS  a  =1  (32) 


by  (19),  that  is. 


?  =  :ri:r.  (33) 

a  S   a 


which  is  the  required  coefficient  vector. 

If  the  coefficients  obtained  from  Eq .  (33)  are  now  used  in  Eq .  (17), 
an  explicit  expression  for  the  averaging  kernel  D  is  obtained.   An  example 
of  such  a  curve  is  plotted  as  the  dashed  curve  in  Fig.  1  for  which  n  =  4. 
The  curve  D  was  chosen  particularly  because  its  negative  lobe  best  coin- 
cides with  the  negative  lobe  of  the  solid  curve  labeled  D  .   Recall  that 
D  ,  which  was  described  previously,  is  based  on  a  single  kernel  function. 
In  order  to  make  the  proper  geometric  comparison  between  curves  D^  and  D„, 
they  each  had  to  be  normalized  by  dividing  the  original  functions  by  the 
maximum  of  their  absolute  values. 
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Two  important  facts  emerge  when  comparing  the  single  channel  approach, 
as  exemplified  by  curve  D  ,  with  the  multichannel  approach,  as  illustrated 
by  curve  D„.   First,  both  lobes  of  D^  are  much  narrower  than  those  of  D^ ; 
hence,  the  mean-value  assumption  required  to  obtain  Eq.  (11)  is  more 
realizable  for  the  multichannel  case.   Second,  the  levels  at  which  the 
positive  and  negative  lobes  of  D„  peak  are  much  closer  together  than  they 
are  for  the  two  lobes  of  D^ .   This  permits  the  wind  vector  difference 
V(t^)  -  V(t^)  in  (11)  to  be  calculated  more  accurately. 

When  Eq.  (16)  and  the  bimodal  averaging  kernel  (17)  are  used  in  place 
of  Eq.  (10),  one  obtains  the  new  result 


->■ 
c.  r . 
1  1 


-ak  x^ 


i=l 


f   v(t)D(c,t)dt  -   r   V(t: 


D(c,t) 


dt 


(34) 


and  then  Eq.  (11)  takes  on  the  form 


c  .r .  = 
1  1 


-ak  X  [V(t  )  -  V(t  )J=  -ak  x 


6V. 


i=l 


(35) 


The  required  geostrophic  wind  components  now  have  the  more  accurate  form 


and 


1^     'h 


(36) 


i=l 


v(t.)  =  v(t.)  +^y^  c.    (^ 
1       2.         a  X  ^  X  dx 


(37) 


i=l 
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As  before,  the  wind  components  at  level  t   are  inferred  from  those  given  at 
level  t^  and  from  the  components  of  the  radiance  gradients,  but  now  there 
is  a  linear  combination  of  radiance  gradients  in  place  of  a  single  gradient 
component . 


4.   BEST  LAYERS  FOR  OPERATIONAL  SOUNDERS 

Of  course  the  purpose  in  deriving  Eqs.  (36)  and  (37)  is  to  apply  these 
results  to  actual  satellite  data.   More  is  said  on  this  subject  subsequently, 
but  in  this  section  we  wish  to  take  advantage  of  the  diagnostic  tools 
afforded  by  the  averaging  kernel  technique.   Since  the  levels  t^  and  t^  at 
which  the  bimodal  averaging  kernel  lobes  peak  are  specified  by  the  user, 
averaging  kernels  can  be  calculated  for  all  possible  combinations  of  pairs 
of  levels  out  of  a  given  finite  set  of  levels  (t,  }.   From  this  collection 

K. 

of  calculations  one  can  determine  the  layers  over  which  the  thermal  winds 
can  be  optimally  derived.   The  outcome  depends  on  the  particular  sounding 
instrument  being  used,  that  is,  on  the  shapes  and  numbers  of  kernel  functions 
that  are  available. 

Optimum  layers  were  determined  for  the  two  currently  operational  sounders, 
namely,  the  DMSF-SSH  sounder  and  the  NOAA-TOVS  sounder.   The  SSH  has  six 
channels  in  the  15-ym  C0„  absorption  band  that  can  be  used  for  temperature 
sounding,  but  since  two  of  these  channels  sense  the  ground,  only  4  channels 
can  be  used  to  derive  vertical  wind  shear.   They  are  centered  at  668.5,  677, 
695,  and  708  cm   .   On  the  other  hand,  the  TOVS  sounder  contains  three 
separate  instrument  packages:  (1)  the  High  Resolution  Infrared  Radiation 
Sounder  Number  2  (HIRS/2)  with  20  channels  primarily  in  the  infrared  region 
of  the  spectrum,  (2)  the  Microwave  Sounding  Unit  (MSU)  with  4  channels  in 
the  5.5-mm  oxygen  complex,  and  (3)  the  Stratospheric  Sounding  Unit  (SSU) 
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with  3  channels  that  employ  a  selective  absorption  technique  in  the  15-)Jin 
C0„  band  to  sound  the  middle  stratosphere. 

Two  separate  tests  were  conducted.   The  first  test  used  only  the  four 
most  absorbing  SSH  channels  in  Eq.  (17)  to  obtain  the  bimodal  averaging 
kernels.   In  order  to  determine  the  optimum  pressure  interval  over  which 
to  calculate  the  wind  shear,  averaging  kernels  for  A5  different  combina- 
tions of  pressure  levels  t^  and  t»  were  computed.   There  was  no  need  to 
calculate  averaging  kernels  for  additional  combinations  of  pressure  levels 
because  the  results  from  neighboring  levels  changed  by  only  small  amounts 
and  in  a  predictable  manner.   A  search  was  made  of  the  45  calculated 
averaging  kernels  for  the  best  ones.   The  criterion  of  "best"  is  those 
that  have  the  narrowest  main  lobes  while  at  the  same  time  have  minimal 
side  lobes.   After  the  search  was  completed  two  cases  were  found  to  be 
superior  to  all  the  others  in  satisfying  the  criterion  of  "best".   The 
averaging  kernels  of  these  two  cases  are  shown  in  Fig.  2  and  are  labeled 
D-  and  D, ;  they  also  were  normalized  as  before  for  comparative  purposes. 
One  concludes  from  these  two  curves  that  the  best   pressure  intervals 
are  from  12  to  125  mb  and  from  80  to  400  mb . 

Expression  (6)  for  the  kernel  functions  indicates  a  temperature  depen- 
dence; therefore,  kernel  functions  were  calculated  for  9  very  different 
U.S.  Standard  Atmosphere  Supplements  (1966).   However,  the  same  two  pairs 
of  layers  that  were  just  given  were  found  to  be   best   in  each  case. 
Although  the  averaging  kernels  for  a  fixed  pair  of  levels  t,  and  to  differed 
in  shape  for  the  9  temperature  profiles  studied,  the  differences  were  only 
in  the  fine  structure  -  the  basic  shape  remained  the  same.   (The  averaging 
kernels  shown  in  Fig.  2  are  for  the  45°N-July  Supplementary  Atmosphere.) 
Of  course,  the  reason  for  this  is  that  the  basic  shape  of  the  kernel  func- 
tion given  by  (6)  is  dictated  by  the  weighting  function  dl  (t)/dt. 
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The  second  test  used  a  total  of  11  channels  from  the  TOVS  instrument 
in  Eq.  (17)  to  obtain  the  bimodal  averaging  kernels.   None  of  the  MSU 
channels  were  used  in  this  test  because  the  field  of  view  is  much  larger 
than  that  of  the  HIRS/2  instrument  and,  therefore,  could  affect  the 
accuracy  of  the  horizontal  radiance  gradients,  relative  to  those  for  the 
other  instruments.   The  SSU  also  has  a  larger  field  of  view  than  that  of 
the  HIRS/2  instrument,  but  because  the  stratosphere  is  believed  to  be  more 
horizontally  homogeneous  than  the  troposphere  is,  the  three  SSU  channels 
were  used  along  with  8  of  the  most  opaque  of  the  HIRS/2  channels.   The 
HIRS/2  channels  used  were  centered  at  668.5,  680,  690,  703,  716,  2240, 
2270,  and  2360  cm""*". 

The  same  45  pairs  of  levels  t^  and  t„  as  were  used  before  were  used 
in  calculating  the  bimodal  averaging  kernels  for  the  11  TOVS  channels. 
This  time  the  search  for   best    levels  lead  to  three  cases  which  are  shown 
in  Fig.  3  as  curves  D^,  D,  and  D.,,  which  were  normalized  as  before  for 
comparative  purposes.   These  3  curves  are  more  "wiggly"  than  those  shown 
in  Fig.  2  for  two  reasons:  (1)  they  are  based  on  a  midlatitudinal  tempera- 
ture profile  that  had  more  fine  structure  than  the  supplemental  temperature 
profile  used  for  Fig.  2,  and  (2)  as  the  number  of  kernel  functions  used  in 
the  linear  combination  (17)  is  increased,  more  structure  is  possible  in 
the  resulting  averaging  kernel.   Calculations  of  averaging  kernels  were  also 
made  for  a  tropical  and  a  polar  temperature  profile  for  the  11  channels,  but 
again  the  averaging  kernels  were  sufficiently  similar  to  those  plotted  in 
Fig.  3  that  the  conclusions  about  the  best    levels  remain  the  same. 

The  conclusions  to  be  drawn  from  Fig.  3  are  that  the  best  pressure 
intervals  for  the  TOVS  sounder  (minus  the  microwave  channels)  are  from  25 
to  150  mb,  from  80  to  325  mb ,  and  from  250  to  625  mb .   As  would  be  expected. 
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(38) 


The  intent  here  is  again  to  use  a  modified  Backus-Gilbert  method  to  produce 
optimal  averaging  kernels,  but  because  the  derived  quantity  in  (38)  is 
vertical  wind  shear  instead  of  the  geostrophic  wind  vector,  different 
optimization  criteria  are  required. 

Following  the  development  in  Section  3,  one  multiplies  each  of  the  n 
equations  in  (38)  by  an  undetermined  coefficient  c.  and  then  sums  the  result- 
ing n  equations  to  obtain  the  single  equation: 


c .  r. 
1  1 


ak  X 


i=l 
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9t 
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c.K.(t) 


dt 


(39) 


The  quantity  in  square  brackets,  though  different  from  that  of  Eq .  (16),  is 
again  an  averaging  kernel  which  will  be  denoted  by  R,  that  is. 


R(c 


'^)  ^Zi^i'^i 


At) 


(40) 


i=l 


This  time  the  choice  of  coefficients  has  a  different  motivation.   We  are 
interested  in  determining  the  average  wind  shear  between  levels  t   and  t„, 
that  is, 


.9V 


1>   =        1     f        3V 
t     t^-t        I  9t 


dt 


(41) 
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Therefore,  if  in  Eq ,  (39)  the  averaging  kernel  were  a  rectangular  unit  func- 
tion defined  by 


{1   for  t^  <  t  <  t^  J 
0   otherwise       j 


p(t;t^,t2)  =  <  >,  (42) 


then  the  mean  value  theorem  applied  to  Eq.  (39)  would  yield  the  average 
wind  shear  defined  by  (41) . 

As  a  consequence  of  the  foregoing  the  averaging  kernel  (40)  should  have 
the  following  properties: 

1.  It  should  approximate  the  rectangular  unit  function  p,  as  defined 
by  (42)  ,  as  closely  as  possible  for  predesignated  pressure  levels 
t   and  t„. 

2.  The  nonzero  portion  of  the  averaging  kernel  (i.e.,  between  t^  and  t^) 
should  have  a  uniform  height  of  unity. 

The  first  property  is  realized  by  minimizing  the  quadratic  form 


t 
s 

Q(c)  =  j      [l-p(t;t^,t2)]^  [R(c,t)  -  p(t;t^,t2)]^  dt 


^T 


'^l  ^ 


s 


I  R^(c,t)dt  +  f        R^(c,t)dt 

t^  ^2 


(43) 


with  respect  to  the  vector  c.   The  factor  [1-p]   is  a  penalty  function.   The 
second  line  of  Eq .  (43)  says  that  the  portion  of  the  averaging  kernel  outside 
the  rectangle  should  have  a  minimum  value.   The  second  property  listed  above 
cannot  be  put  into  a  form  that  can  be  manipulated  analytically.   However, 
a  partial  solution  to  this  difficulty  is  provided  by  the  constraint  that  the 
averaging  kernel  between  t   and  t^  have  an  average  height  of  unity,  that  is. 
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that 
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h-h   J 


R(c,t)dt  =  1 .  (44) 

^1 


The  determination  of  the  vector  c  by  minimizing  the  quadratic  form  (43) , 
subject  to  the  constraint  (44),  is  again  the  problem  given  by  Eqs.  (30), 
(31) ,  and  (32) ,  whose  solution  is  given  by 

^    S"  a  .... 

a  S  a 

which  is  Eq.  (33),  except  that  the  elements  of  the  matrix  S  and  the  vector 
a  are  different.   In  this  case,  by  Eqs.  (40,  (43),  and  (44),  the  respective 
elements  are  given  by: 


1  s 


/K.(t)  K.(t)dt  +   /    K.(t)  K.(t)dt 


S^.  =   /    K. (t)  K,(t)dt  +   /    K, (t)  K,(t)dt  (46) 


T  ; 


and  t^ 


1     t^-t^  j 


(t)dt,  (47) 


for  i,j=l,...,n.   The  vector  c  given  by  (45)  is  the  required  coefficient 
vector,  which  when  applied  to  Eq .  (40)  provides  the  best  approximation  to 
a  rectangular  unit  function  by  the  averaging  kernel. 

When  (40)  is  applied  to  (39)  and  the  second  mean-value  theorem  for 
integrals  is  applied,  one  has 
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c.r^  =  ak  X  <-^>   /    R(c,t)dt 


^=1  t. 


=  ak  X  <|^>  (t2-t^)  .  (48) 


But  by  the  law  of  the  mean  for  derivatives 


V(t„)  -  V(t^)     .;J 


so  that  the  required  geostrophic  wind  components  by  (48)  have  the  form 


(50) 


u(t^)  =  u(t2)  +  <|^>  (t^-t^)  =  u(t2)  +^^  c.  (^)p. 

i=l 

and 

"      9R 
v(t,)  .  v(tp  +  <|^>  (t,-tp  .  v(t,)  -  i^  c.  (^)p.       (51) 

i=l 


The  interpretation  of  Eqs.  (50)  and  (51)  is  (as  before)  that  the  wind  com- 
ponents at  level  t^  are  inferred  from  those  given  at  level  t^  plus  the 
linear  combination  of  components  of  the  radiance  gradients.   Note  that, 
aside  from  a  difference  in  sign,  Eqs.  (50)  and  (51)  are  identical  to  Eqs. 
(36)  and  (37),  as  they  should  be  since  the  starting  points,  namely  Eqs.  (8) 
and  (9),  are  just  different  forms  of  the  same  equation.   However,  this 
numerical  exercise  has  produced  a  large  gain  in  that  all  optimal  pressure 
intervals  are  found  at  one  time  as  the  solution  to  the  problem  given  by 
Eqs.  (45),  (46),  and  (47). 
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6 .   RESULTS 

The  two  tests  described  in  Section  4  were  also  applied  to  the  rectangu- 
lar averaging  kernels  of  Eq.  (40).   All  details  and  conclusions  of  the 
tests  in  Section  4  remain  the  same,  including  the  kinds  and  numbers  of 
channels  used,  the  temperature  dependence  of  the  kernel  functions,  and 
the  combinations  of  pressure  levels  that  were  computed. 

In  the  first  test  three  averaging  kernels  (one  more  than  in  Section  4) 
were  found  to  satisfy  the  criterion  of  "best",  and  are  shown  in  Fig.  4 
with  the  labels  R  ,  R  ,  and  R„ .   They  do  not  have  "rectangular"  shapes 
because  the  kernel  functions  used  in  the  linear  combination  (40)  are  all 
rounded  in  the  manner  of  the  curves  shown  in  Fig.  4.   The  three  curves  in 
Fig.  4  indicate  that  the  best  pressure  intervals  (t  ,t„)  for  the  DMSP 
sounder  are:  10  to  125  mb ,  80-400  mb ,  and  300-750  mb .   For  the  second  test 
the  best  rectangular  averaging  kernels  for  the  TOVS  sounders  are  shown  in 
Fig.  5.   These  five  curves  (two  more  than  in  Section  4)  yield  best  pressure 
intervals  of:  35-150,  115-330,  175-500,  285-625,  and  400-875  mb . 

A  comparison  of  the  unimodal  curves  in  Fig.  4  with  those  in  Fig.  5 
clearly  shows  the  lobes  of  Fig.  5  to  be  narrower  than  those  of  Fig.  4,  with 
the  important  consequence  that  the  sides  of  the  lobes  in  Fig.  5  drop  more 
vertically  than  they  do  in  Fig.  4.   Again,  this  is  to  be  expected  because 
the  additional  channels  that  are  used  to  produce  the  averaging  kernels  in 
Fig.  5  give  more  flexibility  to  the  shape  of  the  averaging  kernels.   Also, 
when  8  HIRS/2  channels  were  used  in  place  of  the  original  11  channels,  the 
conclusions  are  as  before,  that  is,  curve  R,  is  most  strongly  effected, 
curve  R^  is  effected  only  slightly,  and  curves  R^ ,  R^ ,  and  Rq  are  virtually 
unaffected . 
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The  conclusions  of  Sections  4  and  6  as  to  which  pressure  intervals  are 
best  are  conveniently  summarized  in  Table  1.   The  additional  channels  on 
the  TIROS-N  satellite  provided  more  layer  options  than  were  available  from 


Table  1.   Best  layers  over  which  to  calculate  the  average 
vertical  wind  shear,  based  on  the  SSH  and  TOYS 
sounders,  for  the  bimodal  and  unimodal  averaging 
kernels . 


Bimodal  Averaging 
Kernel 

Unimodal  Averaging 
Kernel 

SSH 

12-125  mb 

10-125 

mb 

Sounder 

80-400  mb 

80-400 

mb 

* 

300-750 

mb 

25-150  mb 

35-150 

mb 

TOYS 

80-325  mb 

115-330 

mb 

Sounder 

/'C 

175-500 

mb 

250-625  mb 

285-625 

mb 

* 

400-875 

mb 

Missing  because  only  the  results  from  the  single—constraint 
problem  are  available  (see  the  dicussion  at  the  end  of 
Section  4) . 


the  DMSP  satellite.   Note  the  similarity  between  adjacent  entries  in  the 
two  columns.   The  slight  differences  in  pressure  intervals  can  be  attri- 
buted to  differences  in  the  numerical  procedures  used  to  calculate  the 
averaging  kernels. 

7.   METHODS  FOR  DERIYING  WINDS 

Now  that  the  diagnostic  studies  have  been  completed,  we  can  discuss 

the  main  issue,  namely,  how  can  one  derive  winds  at  one  pressure  level. 
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given  the  winds  at  a  second  level  plus  the  gradients  of  the  radiances.   One 
method  has  been  discussed  in  Sections  3  and  5,  but  other  methods  are  also 
possible.   Brief  discussions  of  three  distinct  methods  are  now  presented. 
A.   Averaging  Kernels 

The  most  obvious  method  to  use  is  the  one  described  in  Sections  3 
and  5.   In  short,  one  prescribes  pressure  levels  t   and  t„  and  then  uses 
Eqs.  (36)  and  (37),  or  equivalently ,  Eqs.  (50)  and  (51).   However,  when 
these  equations  are  applied  to  real  data,  the  noise  in  the  measured  radiances 
must  be  taken  into  account.   A  solution  to  this  problem  is  given  by  Backus 
and  Gilbert  (1970).   Briefly,  one  must  either  derive  a  covariance  matrix  E 
for  the  measurement  error,  or  model  one  from  certain  assumptions  about  the 
behavior  of  the  noise.   Then  the  variance  of  the  solution  due  to   the  random 
errors  of  measurement  is  of  the  form 

a^(c)  =  c"^  Ec.  (52) 

Now  instead  of  minimizing  quadratic  form  (18)  or  (43)  ,  one  must 
minimize  the  linear  combination  of  quadratic  forms 

qQ(c)  +  (1-q)  a^(c),  (53) 

where  0  <  q  <  1.   Expression  (53)  must  also  be  subject  to  the  proper  con- 
straints, which  can  be  (19)  and  (20);  or  just  (19);  or  just  (44).   The 
choice  of  the  value  for  the  scalar  q  depends  upon  the  tradeoff  arrangement 
one  wants  between  minimizing  noise  and  the  accuracy  of  the  shape  of  the 
averaging  kernel.   The  tradeoff  decision  depends  in  turn  upon  the  particular 
application  at  hand.   The  best  pressure  levels  to  apply  to  this  method  are 
listed  in  Table  1. 

This  method  should  be  used  with  real  data  only  if  the  following 
two  assumptions  apply; 
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1.  One  requires  only  geostrophic  winds,  not  the  true  winds,  and 

2.  The  transmittance  functions  of  the  radiative  transfer  equation 
are  known  with  reasonable  accuracy. 

B.   Vertical  Profile  Retrievals 

One  really  need  not  go  any  further  than  Eq .  (8)  or  (9).   Since  both 

of  these  equations  require  quantities  that  are  known  and  available,  namely, 

well-defined  kernel  functions  and  n  vectors  of  V  R, ,  one  can  retrieve 

P  1 

vertical  profiles  of  the  wind  shear  from  Eq.  (8)  or  vertical  profiles  of 
the  geostrophic  wind  vector  from  Eq.  (9).   These  kinds  of  retrieval  methods 
are  described  in  a  paper  by  Fleming  and  Smith  (1967),  and  are  applicable 
because  Eqs .  (8)  and  (9)  are  linear  Fredholm  integral  equations  of  the 
first  kind. 

When  Eq .  (9)  is  used> the  wind  profile  is  available  directly,  but 
since  it  will  be  accurate  in  only  a  relative  sense,  one  should  take  the 
difference  6V  between  the  wind  vector  profile  at  levels  t,  and  t„  and  apply 
this  difference  to  the  first  parts  of  Eqs.  (13)  and  (14),  provided  the  wind 
components  u(t^)  and  v(t„)  are  known. 

If  Eq .  (8)  is  used,  then  the  retrieved  vertical  wind  shear  profile 

->- 

is  applied  to  Eq.  (41)  to  determine  the  average  wind  shear  <-?r— >  between 

at 

levels  t^  and  t„.   This  average  value  is  then  applied  to  the  first  parts  of 
Eqs.  (50)  and  (51),  provided  the  wind  components  u(t  )  and  v(t^)  are  known. 
Again,  the  best  levels  at  which  to  apply  this  method  are  listed  in  Table  1. 
The  same  two  assumptions  that  were  listed  under  Method  A  above  apply  here 
as  well,  except  that  since  this  is  an  inverse  method,  the  transmittance 
functions  of  the  radiative  transfer  equation  must  be  known  with  even 
greater  accuracy  than  that  required  for  Method  A. 
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C.   Regression 

When  true  winds  are  required  (as  opposed  to  thermal  winds) ,  one  of 
the  most  appropriate  methods  is  ordinary  regression  in  which  the  radiance 
gradients  are  used  as  the  predictors.   To  carry  out  this  procedure,  one 
needs  to  compile  a  large  sample  of  true  wind  data  which  are  coincident  in 
time  and  place  with  satellite  radiance  measurements  from  which  horizontal 
gradients  can  be  calculated.   The  necessary  regression  coefficient  matrix 
can  then  be  computed  from  matrices  of  these  coincident  measurements.   This 
method  is  standard,  and  so  details  will  not  be  given  here.   One  can  find 
specifics  in  the  paper  by  Fleming  and  Smith  (1967).   The  best  layers  given 
in  Table  1  apply  to  the  regression  method  as  well  because  those  layers  are 
the  very  ones  over  which  the  radiance  gradients  are  the  most  sensitive  as 
predictors. 

8.   CONCLUSIONS 

The  final  shapes  of  the  averaging  kernels  in  Figs.  2  through  5  are 
not   ideal.   One  reason  for  this  is  that  there  are  too  few  kernel  functions 
to  work  with,  but  a  far  more  important  reason  is  that  one  cannot  dictate 
the  shapes  of  the  individual  kernel  functions  because  they  are  prescribed 
by  the  physical  properties  of  the  atmosphere.   Consequently,  they  are  not 
even  linearly  independent  (that  is,  relative  to  the  noise  level  of  the 
measurements),  and,  therefore,  are  not  very  effective  basis  functions. 

The  use  of  the  words  "optimum"  and  "best"  in  this  report  needs  clarifi- 
cation.  For  a  given  set  of  pressure  levels  t^  and  t_  the  corresponding 
averaging  kernel  is  optimum  in  that  it  minimizes  the  defining  quadratic 
form.   Thus,  there  is  one  optimum  averaging  kernel  for  each  pair  of  given 
pressure  levels.   But  then  out  of  all  of  these  optimum  averaging  kernels 
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the  best  ones  were  chosen  to  be  the  ones  which  had  the  truest  main  lobe 
shape  and  the  smallest  side  lobe  amplitude. 

Even  though  several  methods  for  calculating  winds  were  presented, the 
regression  approach  to  obtaining  true  winds  should  be  the  most  useful 
approach.   In  pursuing  this  approach  several  recommendations  can  be  made 
that  result  directly  from  this  study: 

1.  As  many  channels  as  possible  should  be  used  as  predictors,  even 
though  a  particular  kernel  function  may  contribute  only  weakly  to 
the  level  of  the  atmosphere  in  question. 

2.  Winds  cannot  be  inferred  with  equal  accuracy  at  all  levels  of  the 
atmosphere.   Since  the  preliminary  work  of  deciding  which  pressure 
levels  yield  the  most  accurate  winds  has  been  done  in  this  report 
(when  radiance  gradients  are  used  as  predictors),  one  need  only 
refer  to  Table  1  to  decide  on  the  pressure  levels  at  which  to 
collect  wind  data. 

3.  The  diagnostic  tools  developed  in  this  report  are  useful  in  help- 
ing to  answer  questions  about  the  sensitivity  of  the  solution  to 
a  given  channel  or  about  the  results  that  can  be  expected  if  an 
instrument  other  than  the  SSH  or  TOVS  sounders  is  used  (e.g.,  a 
microwave  sounder) . 
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